智能论文笔记

AquaFeL-PSO: A Monitoring System for Water Resources using Autonomous Surface Vehicles based on Multimodal PSO and Federated Learning

Micaela Jara Ten Kathen , Princy Johnson , Isabel Jurado Flores , Daniel Guti errez Reina

分类：机器学习 | 机器人

2022-11-28

The preservation, monitoring, and control of water resources has been a major challenge in recent decades. Water resources must be constantly monitored to know the contamination levels of water. To meet this objective, this paper proposes a water monitoring system using autonomous surface vehicles, equipped with water quality sensors, based on a multimodal particle swarm optimization, and the federated learning technique, with Gaussian process as a surrogate model, the AquaFeL-PSO algorithm. The proposed monitoring system has two phases, the exploration phase and the exploitation phase. In the exploration phase, the vehicles examine the surface of the water resource, and with the data acquired by the water quality sensors, a first water quality model is estimated in the central server. In the exploitation phase, the area is divided into action zones using the model estimated in the exploration phase for a better exploitation of the contamination zones. To obtain the final water quality model of the water resource, the models obtained in both phases are combined. The results demonstrate the efficiency of the proposed path planner in obtaining water quality models of the pollution zones, with a 14$\%$ improvement over the other path planners compared, and the entire water resource, obtaining a 400$\%$ better model, as well as in detecting pollution peaks, the improvement in this case study is 4,000$\%$. It was also proven that the results obtained by applying the federated learning technique are very similar to the results of a centralized system.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

A Library for Representing Python Programs as Graphs for Machine Learning

David Bieber , Kensen Shi , Petros Maniatis , Charles Sutton , Vincent Hellendoorn , Daniel Johnson , Daniel Tarlow

分类：机器学习

2022-08-15

程序的图表通常是用于代码研究的机器学习的核心要素。我们介绍了一个开源Python库Python_graphs，该图片将静态分析应用于构建适合培训机器学习模型的Python程序的图表。我们的库承认控制流图，数据流图和复合``程序图''的构建，这些图形结合了控制流，数据流，句法和词汇信息。我们介绍了图书馆的功能和局限性，进行案例研究，将图书馆应用于数百万竞争性的编程提交，并展示图书馆的机器学习研究实用程序。

translated by 谷歌翻译

3D face reconstruction with dense landmarks

Erroll Wood , Tadas Baltrusaitis , Charlie Hewitt , Matthew Johnson , Jingjing Shen , Nikola Milosavljevic , Daniel Wilde , Stephan Garbin , Chirag Raman , Jamie Shotton

分类：计算机视觉

2022-04-06

地标通常在面部分析中起关键作用，但是仅凭稀疏地标就不能代表身份或表达的许多方面。因此，为了更准确地重建面，地标通常与其他信号（如深度图像或技术）相结合，例如可区分渲染。我们可以通过使用更多地标使事情变得简单吗？在答案中，我们提出了第一种准确地预测10倍地标的方法，覆盖整个头部，包括眼睛和牙齿。这是使用合成培训数据来完成的，该数据保证了完美的地标注释。通过将可变形的模型拟合到这些密集的地标，我们可以在野外实现单眼3D面重建的最新结果。我们表明，密集的地标是通过在单眼和多视图方案中展示准确和表现力的面部绩效捕获来整合跨帧面部形状信息的理想信号。这种方法也非常有效：我们可以预测密集的地标，并在单个CPU线程上以超过150fps的速度适合我们的3D面模型。请参阅我们的网站：https：//microsoft.github.io/denselandmarks/。

translated by 谷歌翻译

Learning Generalized Gumbel-max Causal Mechanisms

Guy Lorberbom , Daniel D. Johnson , Chris J. Maddison , Daniel Tarlow , Tamir Hazan

分类：机器学习 | (统计)机器学习

2021-11-11

为了在结构因果模型（SCM）中执行反事实推理，需要了解因果机制，它提供条件分布的因子，并将噪声映射到样本的确定性函数。遗憾的是，因象无法通过观察和与世界互动收集的数据唯一确定的因果机制，因此仍然存在如何选择因果机制的问题。最近的工作中，Oberst＆Sontag（2019）提出了Gumbel-Max SCM，它由于直观上吸引的反事实稳定性而导致Gumbel-Max Reparameterizations作为因果机制。在这项工作中，我们认为选择在估算反事实治疗效果时最小化的定量标准的因果机制，例如最小化方差。我们提出了一个参数化的因果机制，概括了Gumbel-Max。我们表明他们可以接受培训，以最大限度地减少对感兴趣查询的分布的反事实效果方差和其他损失，从而产生比固定替代方案的反复治疗效果的较低方差估计，也推广到在培训时间未见的查询。

translated by 谷歌翻译

MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated Evaluation

Alexandros Karargyris , Renato Umeton , Micah J. Sheller , Alejandro Aristizabal , Johnu George , Srini Bala , Daniel J. Beutel , Victor Bittorf , Akshay Chaudhari , Alexander Chowdhury

分类：机器学习

2021-09-29

医疗AI通过支持基于证据的医学实践，个性化患者治疗，降低成本以及改善提供者和患者体验，推进医疗保健的巨大潜力。我们认为解锁此潜力需要一种系统的方法来衡量在大规模异构数据上的医疗AI模型的性能。为了满足这种需求，我们正在建立Medperf，这是一个开放的框架，用于在医疗领域的基准测试机器学习。 Medperf将使联合评估能够将模型安全地分配给不同的评估设施，从而赋予医疗组织在高效和人类监督过程中评估和验证AI模型的性能，同时优先考虑隐私。我们描述了当前的挑战医疗保健和AI社区面临，需要开放平台，Medperf的设计理念，其目前的实施状态和我们的路线图。我们呼吁研究人员和组织加入我们创建Medperf开放基准平台。

translated by 谷歌翻译

SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts

Arie Cattan , Sophie Johnson , Daniel Weld , Ido Dagan , Iz Beltagy , Doug Downey , Tom Hope

分类：自然语言处理 | 机器学习

2021-04-18

确定多个文档的概念提及的练习是自然语言理解中的基本任务。以前关于跨文档Coreference解析（CDCR）的工作通常会考虑新闻中的事件提到，这很少涉及普遍存在的科学和技术的技术概念。这些复杂的概念采用不同的形式或含糊不清的形式，并且具有许多分层级别的粒度（例如，任务和子组织），构成了CDCR的挑战。我们呈现了分层CDCR（H-CDCR）的新任务，其目标是在它们之间联合推断COREREFER集群和层次结构。我们在科学论文中创建SciCo，一个专家注释的H-CDCR数据集，比突出的欧洲ecb +资源大3倍。我们研究了我们为H-CDCR定制的强大基线模型，并突出了未来工作的挑战。

translated by 谷歌翻译

Information Aided Navigation: A Review

Daniel Engelsman , Itzik Klein

分类：机器人

2023-01-03

The performance of inertial navigation systems is largely dependent on the stable flow of external measurements and information to guarantee continuous filter updates and bind the inertial solution drift. Platforms in different operational environments may be prevented at some point from receiving external measurements, thus exposing their navigation solution to drift. Over the years, a wide variety of works have been proposed to overcome this shortcoming, by exploiting knowledge of the system current conditions and turning it into an applicable source of information to update the navigation filter. This paper aims to provide an extensive survey of information aided navigation, broadly classified into direct, indirect, and model aiding. Each approach is described by the notable works that implemented its concept, use cases, relevant state updates, and their corresponding measurement models. By matching the appropriate constraint to a given scenario, one will be able to improve the navigation solution accuracy, compensate for the lost information, and uncover certain internal states, that would otherwise remain unobservable.

translated by 谷歌翻译

Faster Approximate Dynamic Programming by Freezing Slow States

Yijia Wang , Daniel R. Jiang

分类：人工智能 | 机器学习

2023-01-03

We consider infinite horizon Markov decision processes (MDPs) with fast-slow structure, meaning that certain parts of the state space move "fast" (and in a sense, are more influential) while other parts transition more "slowly." Such structure is common in real-world problems where sequential decisions need to be made at high frequencies, yet information that varies at a slower timescale also influences the optimal policy. Examples include: (1) service allocation for a multi-class queue with (slowly varying) stochastic costs, (2) a restless multi-armed bandit with an environmental state, and (3) energy demand response, where both day-ahead and real-time prices play a role in the firm's revenue. Models that fully capture these problems often result in MDPs with large state spaces and large effective time horizons (due to frequent decisions), rendering them computationally intractable. We propose an approximate dynamic programming algorithmic framework based on the idea of "freezing" the slow states, solving a set of simpler finite-horizon MDPs (the lower-level MDPs), and applying value iteration (VI) to an auxiliary MDP that transitions on a slower timescale (the upper-level MDP). We also extend the technique to a function approximation setting, where a feature-based linear architecture is used. On the theoretical side, we analyze the regret incurred by each variant of our frozen-state approach. Finally, we give empirical evidence that the frozen-state approach generates effective policies using just a fraction of the computational cost, while illustrating that simply omitting slow states from the decision modeling is often not a viable heuristic.

translated by 谷歌翻译

OF-AE: Oblique Forest AutoEncoders

Cristian Daniel Alecsa

分类：机器学习

2023-01-02

In the present work we propose an unsupervised ensemble method consisting of oblique trees that can address the task of auto-encoding, namely Oblique Forest AutoEncoders (briefly OF-AE). Our method is a natural extension of the eForest encoder introduced in [1]. More precisely, by employing oblique splits consisting in multivariate linear combination of features instead of the axis-parallel ones, we will devise an auto-encoder method through the computation of a sparse solution of a set of linear inequalities consisting of feature values constraints. The code for reproducing our results is available at https://github.com/CDAlecsa/Oblique-Forest-AutoEncoders.

translated by 谷歌翻译